Camera Testing Tool 2.0

Overview

Redesigning how camera quality is evaluated at scale.

Microsoft's device quality program validates camera performance across Windows-certified devices before launch. Working across 15+ OEM partners and 80+ image quality metrics, the platform supported large-scale evaluation workflows used to review device readiness and camera consistency.

Camera Testing Tool 2.0 redesigned that workflow to make analysis faster, clearer, and easier to compare across devices.

10M+

Devices evaluated globally

15+

OEM partners

80+

Image quality metrics

Camera Testing Tool 2.0 — interface preview

Platform

Windows desktop validation platform (WinUI)

Company

Microsoft — Device Quality Ecosystem

Role

UX research, IA, workflow design, UI system

Timeline

2025 — Present

Program Manager

Coordinates vendor communication and final quality reviews.

Core frustration

Important results were scattered across multiple reports, making review and reporting slower.

Camera Validation Tester

Runs camera tests and reviews device quality metrics.

Core frustration

Threshold references lived outside the tool, slowing analysis and increasing review errors.

The Problem

A pipeline this critical
was running on manual steps.

Every cycle meant jumping between tools that were never meant to work together — uploading device data in one place, looking up thresholds in a separate document, cross-checking per-device reports, then hand-compiling everything into a vendor deliverable.

Inconsistent pass/fail interpretations across testers
Failures missed at the threshold boundary
Vendor reports took hours to produce manually
The old tool surfaced data — it didn't help anyone make a decision

Insights

Data was visible.
Decision-making was still manual.

Teams had access to the data — but not a clear way to interpret it quickly. Interviews and workflow observation revealed the same issue: analysis was fragmented, manual, and difficult to navigate efficiently.

Finding 01

Work was spread across multiple tools

Testers constantly switched between reports just to compare results across devices.

Finding 02

Metrics lacked clear meaning

Testers could see the numbers, but not whether the results were actually good or bad.

Finding 03

Manual work filled the gaps

Teams relied on spreadsheets and hand-written summaries to complete analysis.

Finding 04

There was no unified overview

Understanding overall device quality required collecting information from multiple places.

Design Goals

Four findings.
Three product goals.

The research revealed a consistent issue: the system showed data, but didn't support fast decision-making. These three goals guided the redesign — from workflow structure to interface behavior.

01

Prioritize critical signals

Surface failures and device health before deep analysis. Important issues should never be buried inside dense tables.

02

Make comparison the default

Users should compare devices in one unified view — without switching tabs or assembling reports manually.

03

Add context to the workflow

Threshold guidance and interpretation should exist directly inside the interface, not in external documents.

User Flows

Four flows. Every decision mapped.

Each flow traces a distinct user goal through the platform — with branching decision logic made explicit at every critical step.

Flow 01 · Primary

Core Test Execution

Flow 02

Multiple Devices Testing

Flow 03

Metrics Analysis

Flow 04

Check Testing Result

Flow 05

Quick Decision

Information Architecture

Four screens.
One sequential workflow.

The product structure was rebuilt around the real validation sequence, so each screen owns a clear decision point and reduces context switching for testers and PMs.

Step 01

File Setup

Batch import — all devices configured together, not one by one.

Step 02

Run & Analyze

Validate and review metrics — thresholds embedded directly in the view.

Step 03

Overview

Full-set health summary — pass rate, failures, vendor comparison at a glance.

Step 04

Export Report

Auto-generated vendor deliverable — no manual copy-paste.

Wireframes

Structure before
visual design.

Low-fidelity layouts validated the IA and interaction model before any visual polish — each screen mapped to one moment in the workflow.

Solution

Four decisions.
Each traced to a finding.

The IA was rebuilt around four sequential screens — each owning exactly one moment in the workflow. Every design choice below maps directly to a validated research finding.

Decision 01

Multi-Device Batch Import

Finding	Testers configured validation runs device-by-device — repeating the same setup steps for every hardware variant in a cycle, introducing inconsistency and wasting time.
Decision	Replaced single-device upload with a unified batch import flow. All devices are configured together in one session, with shared settings applied across the set.
Outcome	Consistent run configurations across devices. Reduced setup friction and eliminated a class of errors caused by per-device manual entry.

Decision 02

Threshold Context Embedded in Results

Finding	Testers couldn't interpret metric values without knowing the threshold — so they constantly switched to an external reference sheet, breaking focus and slowing analysis.
Decision	Redesigned the results view to embed threshold context, pass/fail indicators, and boundary proximity directly alongside every metric value.
Outcome	Testers can read and interpret results without leaving the screen. Faster analysis and fewer missed failures at the boundary.

UI — Results with threshold context (view 1)

UI — Results with threshold context (view 2)

Decision 03

Cross-Device Comparison in One View

Finding	Comparing metric performance across devices required opening separate per-device reports — a fragmented experience that made spotting patterns across a vendor set nearly impossible.
Decision	Introduced a unified comparison layer with tabbed metric categories and device toggles, enabling side-by-side analysis without switching screens.
Outcome	Reduced time-to-insight on cross-device analysis. Decision-critical signals visible at a glance, not buried in sequential reports.

Decision 04

Auto-Generated Vendor Report

Finding	After analysis, PMs manually assembled validation results into a vendor deliverable — copying data out of the tool into a separate document, a process that took hours and varied in format.
Decision	Designed a final reporting layer that transforms validated results directly into a structured, vendor-ready summary. One action — no manual compilation.
Outcome	Report generation goes from hours to minutes. Consistent format across all vendor deliverables, regardless of who creates them.

Design System

Built for data density,
not decoration.

Every token was chosen to reduce visual noise — not for aesthetics. Color encodes status (pass / fail / at-risk). Typography keeps dense data tables scannable without feeling cluttered.

Logo Design

A mark built for
precision, not personality.

Concept

A geometric mark referencing a lens aperture — communicating measurement and validation, not photography.

Form Design Reasoning

Grid-anchored geometry over decorative form — data integrity over expression.

Brand Alignment

An internal tool for QA engineers and PMs — not end users. Restrained and functional by design.

System Blue

Trust · Reliability · Microsoft-tone

Ink Dark

Structure · Data density · Authority

Surface Light

Clarity · Breathing room · Hierarchy

Impact

Fewer screens.
Faster decisions. Less guesswork.

Metrics estimated from workflow observation and PM review. Formal usability testing planned for Q2 2027.

70%

Estimated reduction in cross-device comparison time per validation cycle.

Workflow observation

1

Screen to reach metric-level results — down from 3+ screens in the legacy tool.

IA restructure

80%

Of the fragmented workflow steps consolidated into a single structured platform.

PM-confirmed

ML

Mark Lin

Program Manager · Microsoft

"

Kochakorn demonstrated strong system thinking and quickly understood complex technical constraints, translating them into a clear, structured interface that improved metric visibility and usability. Her ability to simplify dense data views while maintaining accuracy was notable — as was her approach combining product mindset with execution discipline throughout the project.

Reflection

What this project
sharpened.

Simplicity is a functional requirement

In enterprise validation tools, every extra click scales into real errors. Reducing cognitive load isn't a preference — it's the most critical thing the interface can do.

Constraints make research stronger

N=1 access forced every insight to be cross-validated before informing a decision. The constraint raised the bar — findings had to earn their way in, not just appear.

What's next.

Moderated usability tests

Run sessions with 3–5 engineers to validate the cross-device comparison flow and metric interpretation against real tasks.

Accessibility audit

Review the color-coded status system against WCAG contrast standards — critical where color encodes pass/fail decisions across 80+ metrics.

Allinonetool

Redesigning how camera quality is evaluated at scale.

Coordinates vendor communication and final quality reviews.

Runs camera tests and reviews device quality metrics.

A pipeline this criticalwas running on manual steps.

Data was visible.Decision-making was still manual.

Four findings.Three product goals.

Prioritize critical signals

Make comparison the default

Add context to the workflow

Four flows. Every decision mapped.

Four screens.One sequential workflow.

Structure beforevisual design.

Four decisions.Each traced to a finding.

Built for data density,not decoration.

A mark built forprecision, not personality.

Fewer screens.Faster decisions. Less guesswork.

What this projectsharpened.